Computer survey for likely genes in the one megabase contiguous genomic sequence data of Synechocystis sp. strain PCC6803.

نویسندگان

  • M Hirosawa
  • T Kaneko
  • S Tabata
  • J D McIninch
  • W S Hayes
  • M Borodovsky
  • K Isono
چکیده

Using the computer program GeneMark, the open reading frames (ORFs) previously assigned within the one megabase sequence data of the genome of the cyanobacterium, Synechocystis sp. strain PCC6803 (Kaneko et al., DNA Res. 2: 153-166, 1995), were re-examined. Matrices required by GeneMark for its statistical calculation were generated and modified by running a script termed GeneMark-Genesis that performed recursive application of GeneMark against the Synechocystis data and evaluated the probability scores for optimization. Based on the matrices thus generated, 752 of the 818 previously assigned ORFs (92%) were supported by GeneMark as likely coding sequences, of which 26 were predicted to start at more internal positions than previously assigned. In addition, 50 ORFs were newly identified as likely coding sequences, most of them being shorter than 300 bp. Thus, the procedure was proven to be very powerful to locate likely coding regions within the genomic sequence data of Synechocystis without having prior information concerning their similarity to the genes of other organisms. However, GeneMark did not predict 66 previously assigned ORFs as likely genes: 14 of them showed significant degrees of similarity to known genes and 10 others were found within IS-like elements. It seems that these genes, many of which appear to be exogenous origin, escaped detection by GeneMark as in the case of "class 3 (horizontally transferred) genes" of E. coli, which in turn suggests that genes of different phylogenetic origins might also be detected as such by modifying the matrices.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction Rate of Coding Regions is Enhanced upto 99.15 % by Joint Use of GeneMark-RC and GeneHacker in Case of a Cyanobacterium

The advancement in large-scale sequencing has accelerated the production of long contiguous nucleotide sequence data. The whole genomic sequence data is currently available for several prokaryotic organisms. The rst step in the analysis of genomic sequence data is to assign coding regions, which is absolutely necessary for a comparative study of one organism with the others and to elucidate com...

متن کامل

Combining two genomes in one cell: stable cloning of the Synechocystis PCC6803 genome in the Bacillus subtilis 168 genome.

Cloning the whole 3.5-megabase (Mb) genome of the photosynthetic bacterium Synechocystis PCC6803 into the 4.2-Mb genome of the mesophilic bacterium Bacillus subtilis 168 resulted in a 7.7-Mb composite genome. We succeeded in such unprecedented large-size cloning by progressively assembling and editing contiguous DNA regions that cover the entire Synechocystis genome. The strain containing the t...

متن کامل

CyanoBase, the genome database for Synechocystis sp. strain PCC6803: status for the year 2000

CyanoBase provides an online resource for access to data on genomic information about the cyanobacterium Synechocystis sp. strain PCC6803. The database contains annotations for each protein-coding gene deduced from the entire nucleotide sequence of the genome, gene classification lists, and keyword and similarity search engines. Core portions of CyanoBase consist of annotations for each of the ...

متن کامل

Study of Light Wavelength Dependency in Red-Orange Spectrum on Continuous Culture of Synechocystis sp. PCC6803

In this study, the effect of light wavelength on growth rate and lipid production of Synechocystis was investigated. Continuous cultivation system was used to have uniform cell density and avoid self-shading in order to obtain more precise results. Based on previous studies, red light is more efficient than other colors in the visible spectrum for cultivation of Synechocystis; however, the opti...

متن کامل

Glycogen Synthase Isoforms in Synechocystis sp. PCC6803: Identification of Different Roles to Produce Glycogen by Targeted Mutagenesis

Synechocystis sp. PCC6803 belongs to cyanobacteria which carry out photosynthesis and has recently become of interest due to the evolutionary link between bacteria and plant species. Similar to other bacteria, the primary carbohydrate storage source of Synechocystis sp. PCC6803 is glycogen. While most bacteria are not known to have any isoforms of glycogen synthase, analysis of the genomic DNA ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • DNA research : an international journal for rapid publication of reports on genes and genomes

دوره 2 6  شماره 

صفحات  -

تاریخ انتشار 1995